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We develop a formalism for single molecule dynamic force spectroscopy to map the 
energy landscape of protein-protein complex {P1P2). The joint distribution P(t\,T2) 
of unbinding lifetimes t\ and T2 measurable in a compression-tension cycle, which 
accounts for the internal relaxation dynamics of the proteins under tension, shows 
that the histogram of t\ is not Poissonian. The theory is applied to the forced 
unbinding of protein P\, modeled as a wormlike chain, from P1P2. We propose a 
new class of experiments which can resolve the effect of internal protein dynamics 
on the unbinding lifetimes. 

Many biological functions are mediated by interactions between biomolecules under mechan- 
ical stress. Protein-DNA interactions involve force-induced motion of proteins . Similarly, 
specific protein-protein interaction in cell-protein complexes are important in molecular recog- 
nition j^l- Dynamic force spectroscopic techniques probe these interactions by forced unbinding 
of protein-protein complexes using forces in the lpN—lOOpN range and can be used to map the 



complex energy landscape underlying protein-protein association 



imm. 



Atomic force 



microscopy (AFM) has been employed in the studies of protein-protein interactions involving 
immunoglobulins ^ , molecular motors 0, C| and cell adhesion complexes 

BBS. 

In constant force-induced unbinding of single protein-protein complexes, the histograms of 
unbinding lifetimes is fit using the Poisson distribution 

P u (r; f ext ) = fciQfext) exp [-k^f^r]. (1) 



* Corresponding author phone: 301-405-4803; fax: 301-314-9404; thirum@glue.umd.edu 



2 



The dependence of the unbinding rate constant fci =1/ t u (t u is the lifetime of the complex) on the 
external force f ext is given by the Bell model [11], ki(f ext )=ki exp [f ext (J /ksT]. The parameter 



a is the maximum protein-protein bond extension before rupture, and fcio is the force-free 
unbinding rate of the bound complex P\P2- Because the Poisson approximation ignores the 
intrinsic dynamics of proteins (i.e. conformational motions and rearrangements), this analysis 
can only be used when t u exceeds the timescale of internal protein motion, tr. However, 
NMR studies of relaxation dynamics of proteins show that tr of single chain proteins ranges 
from nanoseconds to tens of miliseconds Q|. Lifetime measurements of a single P-selectin 
receptor with specific ligand PSGL-1 show that r u varies between miliseconds and few seconds 
depending on the magnitude of f ext 3, f|. Because the lifetimes of the protein-protein complex 
under force become comparable to tr, the interpretation of the unbinding data is complicated 
by protein motion. Thus, Eq. (0) cannot be used to describe experimental histograms of the 
lifetimes. To account for the competing timescales (tr and t u ) a theoretical framework that 
probes correlations between intrinsic relaxation and unbinding dynamics is needed to analyze 
experimental data. 

In typical AFM experiments, the cantilever tip coated with protein Pi is brought into contact 
with the surface-attached protein P2, and allowed to interact for a time At so that the complex 
P1P2 can form (compression cycle). The tip is then retracted to a prescribed distance which 
results in the complex feeling a constant force f=/ eX iX in the direction x perpendicular to the 
surface (tension cycle). The lifetime r at which P\P<i bond breaks is recorded. However, if t u ~tr, 
there is a finite time (~Tr) for propagation of the constant tension from the pulled terminus of 
Pi to the binding interface of the P1P2 complex. Thus, the average time r u to break the P1P2 
bond (assuming that cantilever spring constant is stiff compared with the non-covalent linkages 
that stabilize P\ and P2), is enhanced by tr resulting in the "apparent" lifetime t^tr+t u of 
the complex. 

In this Letter we propose a novel theoretical methodology for describing forced unbinding 
which allows for accurate estimation of protein-protein interaction parameters. The approach 
is based on analyzing not only the distribution of single lifetimes P(t) but also the joint distri- 
bution P(t\,T2] At) of lifetimes T\ and T2 separated by compression time At. The distribution 
P{ti,T2] At) is measurable by constructing the joint histogram of lifetimes using current exper- 
imental methods. Because in current AFM assays At can be as short as microseconds [13], At 
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can be varied by changing the frequency of the compression cycle; P(ti, t 2 ; At) can be utilized 
to resolve tr which in turn can be used to obtain t u , and free-energy landscape parameters a 
and k w . The theory describes protein-protein complexes that obey Pi+P 2 ^ ± P\P2, and can be 
extended to more elaborate kinetic and protein models. 

Basic concepts: Typically, for specific protein-protein complexes the binding rate for 
Pi+P 2 ^PiP2 is fast, and At is controlled by the duration of the compression cycle. Because of 
the conformational fluctuations of Pi, the binding interface experiences a restoring force f(X, t) 
which tends to decrease the end-to-end distance X(t). As t increases, the unbinding force along 
the coordinate X increases so that f{X,t)^>f ext as t—>-oo, and X(t) approaches the equilib- 
rium force-dependent value (X(f ext )). Due to the conformational dynamics of the proteins, 
the unbinding rate, k\(X, t)=kioexp [af(X,t)/kBT], is a stochastic variable that depends on X 
through f(X). When application of f ext does not result in complete stretching of P\ (X=L), 
the instantaneous value of force along the P1P2 bond is equal to the restoring force 

f(X,r) = -k B T p{x ^ ^ (2) 

where the probability that P\ has end-to-end distance X at time t is given by 
P(X, t)=jjrgj oIXqAt:XqGq{X^ t; Xo)il) eq (Xo) and N(t) is a normalization constant. When 
f ext is large to fully stretch P 1 , the force felt by P1P2 bond spikes up to f ext at X=L, i.e. 
f=f(X,r)h(L — X)+f ext h(X — L), where h(X) is the Heavyside step function. In this Let- 
ter we consider f ext that does not exceed the unfolding force threshold. The unbinding time 
distribution is given by the convolution of unbinding kinetics and dynamics of X, i.e., 

P(r,f ext ) = -J- f ' dX^Xl / L rfX 4 7 rX 2 P u (X 1 ,r)G /c:ct (X 1 ,r;X )^ e(? (Xo) (3) 
iV i Jo Jo 

where iVi is a normalization constant. In Eqs. (j2J) and ©, G (Xi,t; X ) and Gf ext (Xi, t; X ) 
are respectively the force-free and force- dependent conditional probability of X at time t and 
ijj eq (X) is the equilibrium distribution of X. The unbinding probability P u (X,t) depends on X 
through fci, i.e. P U (X, t)=ki(X, t)exp [—ki(X, t)t]. The above equation is a generalization of Eq. 
(JIJ) for force exerted on P1P2 bond that continuously evolves from zero to f=f e xt over time tr. 
In the limit t r <^t u , P(r, f ext ) reduces to P u (r, f ext ) given by Eq. (JTJ). 

A model application: To illustrate the consequences of the stochastic nature of k\(X, t), which 
reflects the underlying dynamics of proteins, we assume that a thermally fluctuating worm-like 



4 



chain (Pi) is in contact with the immobile P 2 . Upon application of force f ext , extension of Pi 
results in unbinding. In this example, the timescale for internal modes of Pi are comparable to 
the unbinding lifetime. The Hamiltonian for Pi is 



H 



3k B T 
3k h T 



-L/2 V ds 



dr(-L/2,t) 
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(4) 



ds 



dr(s,t) 
ds 



where l p is the protein persistence length and r(s,t) is the location of protein monomers 
—L/2<s<L/2 at time t. The end-to-end vector is X(t)=r(L/2, t)— r(— L/2, t), where L is the 
protein contour length. The statistics of X can be represented by a large number of independent 
modes when L/Z P 3>1. Thus, it is reasonable to assume that Go(X, t;X Q ) is a Gaussian. In the 
overdamped limit, when f ext exceeds the unfolding threshold force, stretching of Pi is smooth 
and thus, preserves Gaussian statistics, 

3/2 



G (X, t; X ) 



1 



exp 



3(X - <f)(t)X ) 2 
'2(X*)(l -4?(t)) 



(5) 



2n(X 2 )) (l-0 2 (t)) 3 / 2 

specified by the mean value (X(t)) = 4>(t)X and variance a 2 =(X 2 ) — (X) 2 , where the correla- 
tion function (f)(t) = (X(t)X(0))/ (X 2 ). To construct G (X, t; X Q , 0) we compute (X(t)X(0)) and 
(X 2 )=lim^ oo (X(t)X(0)) with f ex t=Q- By using Eq. (jlj) and assuming that the dynamics of 
the worm-like chain in the overdamped random media is described by a stochastic force £(s,t) 
with white noise statistics, (£ a (s, t))=0 and (£ a (s, t)£p(s', i/)) =2'~/k B T8 a p8(s — s')5(t —t'), where 
a=x,y,z and 7 is the friction coefficient, we arrive at the Langevin equation: 



(6) 



where e=3l p k B T/4: and v="ik B T j2l v . We solve Eq. © for r(s, t) with boundary conditions 



[2i/£r-e^r] ±L/2 =0, [2z/ f r±e^r] ±L/2 =0, where i/ =3fc B T/4 to yield 



a 2 



(X(t)X(0)) = 12fc B T^-^ 2 (L/2)e 

where the odd eigenf unctions are ^| 

^n(s) = \/c n /L 



n 



l,3,...,2g+l 



(7) 



71=1 



cos [a n L/2] 



sin a„s 



A, 



cosh[/5 n L/2] 



sinh [j3 n s\ 
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with normalization constant c n . The eigenvalues z n =ea^ l +2va 2 and the constants a n , f3 n are ob- 
tained by solving a„sin [^]cosh /^cos [Sf^]sinh ±(a 2 n + /^)cos [^] cosh [A^]=0 
and j3 2 — a 2 = -k. In the limit, L/L— >oo, we arrive at the Rouse chain model describing the 
stretching modes ^p^■=^ ] /2/Lsm(n^Ts/L) with eigenvalues z^=3n 2 n 2 kBT /2l p L 2 . To construct 
force-dependent propagator G f ext (X , t; X ) , we integrate Eq. © with f ext x added to £(s,t) to 

obtain (X 2 ) fex =(X 2 )o+ fLEZM(L/2)/4- 

We computed P(r, f ext ) by integrating Eq. (jHJ) at room temperature. The parameters L, l p 
and j=kBT/ DL determine the timescale of protein motion r^max^ / z n } . We set /cio=0.1/is _1 , 
a—l.Onm, L=80nm, l p =0Anm and D=10~ 8 cm 2 / s. The largest eigenvalue z\ / / y=0. 2 fis -1 deter- 
mines the longest relaxation timescale tr£=:5//s. In left panels of Figure 1 we compare P(r, f ext ) 
for WLC and Rouse model (Eq. Q) with the Poisson approximation P u (r, f ext ) (Eq. ([!])) 
for f ext =lpN, 3pN and lOpN. At f ext =3pN and lOpiV, P(r, J^) for WLC model is in good 
agreement with P(r, f ext ) computed for the Rouse model. A slight overestimate in P(r) at short 
r's and lower f ext =lpN is due to faster relaxation of the Rouse modes. For fci~Zi/7, Poisson 
approximation P u (t) deviates noticeably from P(t). Deviations grow as / ex t is increased from 
lpN to lOpiV; P u (t) overestimates P(r) at shorter r and underestimates P(r) at longer r, 
predicting shorter lifetimes. Therefore, in cases when protein conformational relaxation and 
unbinding dynamics occur on similar timescales the use of Poisson approximation leads to in- 
accurate estimates of k\o and a. In the right panels of Figure 1 we compare P(r, f ext ) for the 
WLC and Rouse modes with Poisson approximation P u (r, f ex t) for zi/ r y=2fis~ 1 '^>ki . A tenfold 
increase in z\/j corresponds to less overdamped environment with larger D=10~ 7 cm 2 / s (the 
other parameters are same as in left panels). Because, it now takes an order of magnitude shorter 
time to propagate f ext from the pulled end of Pi to the P1P2 interface, Poisson distribution P u 
follows closely P(r,f ext ) at lower f ext =lpN and 3pN. However, P u deviates from P(t, f ex t) at 
higher f ext =10pN due to rapid force-induced increase in the unbinding rate k\. Thus, even when 
propagation of tension is rapid there are substantial deviations from Poisson distribution of bond 
lifetimes at higher f ext . 

A practical methodology that can be used in conjunction with experimental data to accurately 
estimate of k w and a is required. Dynamical signatures of protein motion can be assessed 
by computing the joint distribution P(ri, r 2 ; At) of consecutive unbinding times, T\ and r 2 , 
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separated by compression time At, 

P{n,r 2 ;At,f ext ) = [ L dX 3 47iXl [ L dX 2 4nX 2 2 f dX^nX* t dX 4nX* (9) 

iV 2 Jo Jo Jo Jo 

x P U (X 3 , T 2 )G fext (X 3 , r 2 ; X 2 )P b (X 2 , At)G (X 2 , At; X,) 
x P u (X 1 ,r 1 )G' /ext (X 1 ,T 1 ;X )^(Xo) 

where Pbif) is the binding probability for Pi-\-P 2 ^P\P 2 and N 2 is a normalization constant. In 
Eq. (|9~J). (^(A^, t; Xi) is the force free propagator representing correlations of two interaction 
events decaying over tr. When TR>At, P(ti,t 2 ; At) is a sensitive measure of protein motion 
and thus, can be employed to estimate tr. When r R <^At, unbinding events are independent, 
\im At ^ 00 Go(X 2 ,At;X 1 )^ eq (X 2 ), and hence, P(n, r 2 )— >P(ti)P(t 2 ). 

We computed P(ti,t 2 ; At) for At—l/^s^j/zi, At=10/is~7 / z\ and At=500yus^>7/,2i for 
f ext =3.0pN and fcio=0.1//s -1 , cr=1.0nm, L=80nm, l p =0Anm and £1/7=0. Ol/is -1 (Fig. 2). 
We assumed that protein binding (P\-\-P 2 ^PiP 2 ) is independent of the dynamics of X, i.e. 
once Pi reached the vicinity of binding interface of P 2 it binds, and set P&(A", At)=p,=l in 
Eq. (jnj). A short At=lfis and 10/is peak in P(ti, t 2 ) (top and middle panels) is washed out at 
longer At=500/zs (bottom). Striking asymmetry of the contour plots at short At is due to the 
dependence of shorter T2-events on longer Ti-events. During the first interaction the constant 
force felt by P1P2 bond is ramped up from f=0 to f—f ex t following the restoring force f(X,t) 
thus, prolonging T\. When At<CTR~7/zi, the next binding event takes place (at t=At after 
the first unbinding) when Pi is partially or fully stretched. As a result, the binding interface 
experiences non-vanishing restoring force from the beginning of the second interaction event 
and t 2 <t\. Contour plots of P(ti, t 2 ) become more symmetric as At is increased to 10/zs which 
implies growing statistical independence of unbinding events. At At=500fis^>r R , P{ti,t 2 ) is 
symmetric density, which results in factorization P(ti, t 2 )=P(ti)P(t 2 ). Thus, to obtain sta- 
tistically meaningful distributions of uncorrected unbinding times, unbinding events must be 
separated by much longer At compared to tr whose a priori determination is difficult. 

Application to Experiments: Using D(t\, t 2 ; At)=P(ri, r 2 ; At)— P(ri)P(r 2 ), correlations be- 
tween n's and r 2 's can be probed in AFM experiments. If P^O, the unbinding events are 
influenced by conformational fluctuations of the protein. For the model parameters in Fig. 2 
we show in Fig. 3 P(ti = r 2 ; At) for At=l/xs, 10/is and 500/is. The peak of P(t, At)/D{r, 0), 
which signifies the amplitude of correlations between the unbinding events, decays to zero as 
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At is increased from lyus<^7 jz\ to SOO/^s^y/^i. An accurate statistical analysis of unbinding 
lifetimes can be made using the following steps. From the unbinding time histogram P(t, f ext ) 
and the "apparent" mean lifetime r app the joint histogram P(ti,t 2 ; At) for At<^T app is com- 
puted. The difference -D(ti,t 2 ; At) is evaluated using P(t, f ex t) and the experimentally deter- 
mined P(ti, t 2 ; At, fext)- If -D~0, the unbinding events are uncorrelated, and k\ can be estimated 
by fitting Eq. (JTJ) to P(r,f ext ). If -D>0, the unbinding and protein motions are correlated. In 
this case the lifetime measurements must be repeated for longer At. Using the new data, new 
distributions -P(r 1 ), P(ti,t 2 ; At) and D(ti,t 2 ; At) can be calculated. The process is iterated 
until the requirement D^O is satisfied for the compression cycle of duration, say, At*. The 
protein relaxation time tr is the minimum value of At=At* at which D^O. Uncorrelated life- 
times collected for At3>TR^At* can then be binned to obtain the final histogram P(r). If 
T~R < ^T app =TR+T u then r app ~r M , and fcio and a can be estimated by fitting Eq. (0) to P(r, fext)- 
However, if T R ~r app , P(r, f ext ) must be analyzed using Eq. (j3J) for given f ext , L, •y=k B T/DL, 
and estimated r R . Thus, the theory presented here suggests a novel dynamic correlated force 
spectroscopy in which measurements of P{j\, r 2 ; At) for varying At can be used to account for 
the influence of internal protein dynamics on forced unbinding of protein-protein complexes. 
This work was supported by National Science Foundation Grant NSFCHE-05-14056. 
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FIGURE CAPTIONS 



Figure 1 The distribution of unbinding times P(r, f ext ) for WLC (solid) and Rouse model 
(dash-dotted lines) of protein and Poisson approximation P u (r, f ext ) (dashed lines) for f ext = 
lpN, 3pN and lOpN computed for fci~zi/7 (left) and k\<^z\/^ (right panels). 

Figure 2 The joint distribution P(t 1 ,t 2 ; At, f ext ) of lifetimes T\ and r 2 separated by 
At=\\xs (top), 10/is (middle) and 500/is (bottom panels) for f ext =3pN. The contour plots 
of P(ti, r 2 ; At, f ext ) are shown on the right. 

Figure 3 Normalized correlation amplitude D(r, At)/D(r, 0) of equal lifetimes t=t 1 =t 2 
separated by At—l/is (solid), 10/is (dash-dotted) and 500/is (dashed lines) for f ext =3pN. 
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Fig. 1 (V. Barsegov and D. Thirumalai) 
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g. 3 (V. Barsegov and D. Thirumalai) 



